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The ability to extract word-forms from sentential contexts represents an initial step in 
infants' process toward lexical acquisition. By age 6 months the ability is just emerging 
and evidence of it is restricted to certain testing conditions. Most research has been 
developed with infants acquiring stress-timed languages (English, but also German and 
Dutch) whose rhythmic unit is not the syllable. Data from infants acquiring syllable-timed 
languages are still scarce and limited to French (European and Canadian), partially revealing 
some discrepancies with English regarding the age at which word segmentation ability 
emerges. Research reported here aims at broadening this cross-linguistic perspective by 
presenting first data on the early ability to segment monosyllabic word-forms by infants 
acquiring Spanish and Catalan. Three different language groups (two monolingual and one 
bilingual) and two different age groups (8- and 6-month-old infants) were tested using 
natural language and a modified version of the HPP with familiarization to passages and 
testing on words. Results revealed positive evidence of word segmentation in all groups 
at both ages, but critically, the pattern of preference differed by age. A novelty preference 
was obtained in the older groups, while the expected familiarity preference was only 
found at the younger age tested, suggesting more advanced segmentation ability with an 
increase in age. These results offer first evidence of an early ability for monosyllabic word 
segmentation in infants acquiring syllable-timed languages such as Spanish or Catalan, 
not previously described in the literature. Data show no impact of bilingual exposure in the 
emergence of this ability and results suggest rapid gains in early segmentation for words 
that match the rhythm unit of the native language. 



Keywords: word segmentation, syllable-timed languages, natural speech, rhythmic unit, preference pattern, 
infants 



INTRODUCTION 

The identification of possible word-forms within sentential con- 
texts represents an initial step in infants' process toward lexical 
acquisition. Extracting word units from the input and detecting 
repetitions of these units in different contexts is considered a basic 
skill related to early vocabulary construction. Research has already 
shown an associative link between these early skills and later lan- 
guage outcomes (Newman et al., 2006; Junge et al., 2012; Singh 
et al., 2012). Characterizing the emergence of word segmentation 
ability is, thus, important in relation to the early building and 
growing of lexical knowledge. More specifically, exploring this 
emergent capacity in infants exposed to languages with different 
rhythmic structure offers the opportunity to identify differential 
features in the segmentation strategies used by infants, as well 
as possible variation in its developmental time-course. Finding 
evidence of variation in the time course for word segmentation 
might ultimately be useful to account for possible differences 
in early lexical acquisition processes from a cross-linguistic per- 
spective. The present research addresses this issue by exploring 
early word segmentation abilities in infants exposed to Spanish 



and Catalan, two Romance languages whose rhythmic properties 
differ from the properties of languages that have already been 
analyzed in previous word segmentation studies. 

The ability to segment and recognize unfamiliar words from 
fluent speech was first explored in the pioneering research devel- 
oped by R W. Jusczyk and R. N. Aslin in 1995. In their seminal 
paper they showed that 7V2 -month-old, but not 6-month-old 
English-learning infants, were able to extract short, monosyllabic 
word-forms from natural speech passages containing repetitions 
of two different target words (Jusczyk and Aslin, 1995). Whether 
familiarized to lists of words and then tested with passages, or 
familiarized to passages and then tested on words, infants in both 
testing conditions showed the capacity to extract and recognize 
possible "lexical" units (word-forms) and they did so by retain- 
ing rather detailed information about the phonetic form of these 
word candidates. Even though words in that experiment were 
short, simple monosyllabic items (bike, dog, cup, feet), infants 
younger than 7 months of age did not succeed in the task. Follow- 
up work explored the ability to segment bi-syllabic words and 
it was shown that for words following the predominant stress 
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pattern in the language (i.e., the trochaic or strong/weak -SW- 
stress pattern in the case of English), this ability was also present 
by 7Vi months of age (Jusczyk et al., 1999a). Taken together these 
results were interpreted as an indication that prosodic informa- 
tion, here based on the predominant stress pattern of content 
words in English (around 90% of content words begin with a 
stressed syllable, according to Cutler and Carter, 1987), could 
be used by infants to successfully find word-form units in con- 
nected speech. This prosodic hypothesis (defined as the Metrical 
Segmentation Strategy -MSS- in Jusczyk, 1999) could explain 
both the results from the monosyllabic and the trochaic word 
segmentation experiments, as items in the monosyllabic study 
were strong syllables with full vowels. The importance of prosodic 
information in early word segmentation was first described in 
these early studies and subsequent work contributed to give sup- 
port to the relevant role of prosody in infants' dealing with 
the word segmentation problem (Johnson and Jusczyk, 2001; 
Johnson and Seidl, 2008, but see Thiessen and Saffran, 2003; 
Pelucchi et al., 2009 for an alternative position to the prosodic 
bootstrapping approach). 

If we admit that segmentation strategies based on prosodic 
information derive from the specific rhythmic properties of the 
native language, then these strategies might differ in populations 
acquiring languages with different rhythmic structure. Research 
with young infants has shown that they are sensitive to global 
prosodic features contained in the linguistic input (Bosch and 
Sebastian-Galles, 1997; Nazzi et al, 2000a). These prosodic fea- 
tures may offer first cues to segment the input into linguistically 
relevant units such as clauses and phrases within which word- 
form units can eventually be extracted (Hirsh-Pasek et al, 1987; 
Nazzi et al, 2000b; Soderstrom et al, 2003; Seidl, 2007; Seidl and 
Cristia, 2008). But beyond these global prosodic cues, attention to 
the specific rhythmic properties of the native language and detec- 
tion of the specific rhythmic unit operating in that language can 
lead to the emergence of segmentation strategies most adequate to 
extract words from fluent speech. This is actually the hypothesis 
behind the so-called early rhythmic segmentation proposal devel- 
oped by Nazzi et al. (2006). Cross-linguistic differences regarding 
the type of rhythmic strategy and rhythmic unit used for seg- 
mentation can then be expected for languages differing in their 
rhythmic properties. 

A gross partition of the languages based on linguistic rhythm 
has traditionally identified three broad rhythmic types, i.e., stress- 
timed, syllable-timed and mora-timed languages, each of them 
associated to a different underlying rhythmic unit (Abercrombie, 
1967; Ladefoged, 1975). Germanic languages such as English, 
Dutch, or German would belong to the first type, having the 
trochaic stress unit at the basis of their rhythmic structure; 
Romance languages such as French, Italian or Spanish would 
be examples of the second type, having the syllable as the basic 
rhythmic unit and, finally, languages like Japanese would belong 
to the third type relying on the sub-syllabic mora as the basic 
unit of rhythm. Initially, these typologies were considered to 
derive from the notion of isochrony between successive units 
(syllables, feet or morae depending on the type of language), 
however, subsequent measurements obtained from different lan- 
guages questioned this idea. Linguistic rhythm is more accurately 



described as an alternation of elements: vowels and consonants 
at the most basic level and syllables (stressed and unstressed) 
and feet at subsequent levels (Nespor et al., 2011). Factors such 
as variability in syllable structure complexity and the degree of 
vowel reduction are considered key elements in accounting for 
language rhythm differences. The study of durational correlates of 
such phonological phenomena has become the focus of research 
aimed at identifying the specific properties underlying rhythmic 
differences between languages. 

Different rhythm metrics have been used in studies analyzing 
limited sets of cross-linguistic material. Ramus et al. (1999) mea- 
sured duration of vocalic and consonantal intervals in the speech 
signal. By plotting the percentage of total utterance duration 
comprising vocalic intervals (%V) against the standard devia- 
tion of consonantal intervals (AC), they succeeded at adequately 
grouping the eight languages under study (English, Dutch, Polish, 
Spanish, Italian, French, Catalan, and Japanese) into the three tra- 
ditional rhythm typologies. Low et al. (2000) proposed pairwise 
variability indices (nPVI and rPVI, normalized and raw, respec- 
tively) in an attempt to better capture the durational differences 
between successive vocalic and consonantal intervals. However, 
only measurements from the nPVI-V scores could group sepa- 
rately English, German, and Dutch on the one hand, and Spanish 
and French on the other, failing to place Japanese in a different 
area. Interestingly, languages considered more difficult to classify 
in terms of rhythm structure, such as Catalan and Polish (Nespor, 
1990), showed intermediate positions in the PVI space. More 
recently, White and Mattys (2007) using rate-normalized metrics 
of vocalic interval variation (VarcoV) plotted against %V mea- 
surements, showed again that "stress-timed" Dutch and English, 
and "syllable-timed" French and Spanish could be distinguished, 
but at the same time their analysis revealed that the notion 
of a strictly categorical distinction between these two rhythmic 
typologies was far from perfect, with Dutch and French placed 
in a more intermediate position between stress-timed English 
and syllable-timed Spanish. In general, results from these met- 
rical studies offer empirical support for the existence of broad 
rhythmic distinctions between languages, but critically, they also 
provide a more nuanced perspective on the nature of rhythmic 
differences that goes beyond the initial notion of three distinct 
language typologies (see White et al., 2012). From this perspec- 
tive, differences in the emergence of the word segmentation ability 
may be found not only when comparing languages tradition- 
ally ascribed to a different rhythmic typology (e.g., stress-timed 
English and syllable-timed French), but also for languages tra- 
ditionally grouped under the same typology (e.g., syllable-timed 
French and Spanish, or stress-timed English and Dutch). 

What evidence can be found about cross-linguistic differ- 
ences in the skill to segment words from fluent speech early in 
development? A review of the early word segmentation literature 
immediately reveals that research has been developed mostly in 
English and cross-linguistic data are still scarce. As already men- 
tioned first evidence of word segmentation with natural language 
material came from English-learning 7Vi -month-old infants and 
restricted to specific types of words such as monosyllabic and 
trochaic items (Jusczyk and Aslin, 1995; Jusczyk et al., 1999a). 
Evidence for this ability at an earlier age (6 months) was later 
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attested by using a slightly different methodological approach, 
in which highly familiar words (infants' own names) preceded 
the target monosyllabic units (those used in Jusczyk and Aslin, 
1995 study) in familiarization passages (Bortfeld et al., 2005). In 
this situation, familiar words were probably acting as anchors and 
facilitated segmentation of the adjacent elements, which could 
not otherwise be easily extracted. Without additional cues to seg- 
mentation, English-learning infants are just beginning to segment 
simple word forms from fluent speech around 7 months of age. 
It is interesting to note that evidence of segmentation is shown 
by a familiarity preference pattern, whether familiarization be 
based on passages or word lists. The direction of the preference 
has been linked to task demands (Hunter and Ames, 1988). The 
specific direction of the preference pattern in word segmentation 
tasks (novelty versus familiarity) and its changes during develop- 
ment can be explained from factors such as the duration of the 
familiarization, stimulus complexity, degree of similarity between 
familiarization and test stimuli, and more generally, from exper- 
tise acquired with age (Thiessen et al, 2005). Thus at younger 
ages, when segmentation ability is just emerging, a familiarity 
preference is to be expected, as found by Jusczyk and Aslin (1995) 
in 7Vi month-old infants. 

For studies examining the early emergence of segmentation 
ability in stress-timed languages other than English, only some 
data from German and Dutch are available. Segmentation of 
unstressed closed-class elements has been shown in German- 
learning infants from 7Vi months on, but not before (Hohle and 
Weissenborn, 2003). The procedure involved familiarization with 
isolated words and test on passages and, as expected, a familiar- 
ity preference was found paralleling Jusczyk and Aslin's results, 
but this time on unstressed material (although from an acousti- 
cal perspective, closed-class grammatical morphemes experience 
less vowel reduction in German than in English). According to the 
rhythmic segmentation hypothesis these unstressed monosyllabic 
elements should have been difficult to segment at that age. It was 
argued, however, that their special status and potential role in the 
acquisition of morpho-syntactic knowledge might have favored 
successful segmentation at an early age. 

Evidence from Dutch-learning-infants revealed a slightly later 
emergence of the segmentation ability (at 9, but not at 7 
months of age) using HPP and trochaic words (Kuijpers et al, 
1998). Further research replicated 9-month-olds' segmentation of 
trochees in this language and confirmed that the ability to extract 
words from fluent speech is not dependent on familiarity with 
the phonetic structure of the input, as English-learning infants 
also succeeded in the task with Dutch material (Houston et al., 
2000). The same strategy could be exported to successfully extract 
word units in another stress-timed language with similar rhyth- 
mic properties. In spite of the slightly older age of the Dutch 
participants, segmentation evidence resulted from a familiar- 
ity preference. Could rhythmic differences between English and 
Dutch, as described by White and Mattys (2007) using VarcoV 
and %V measurements, have impacted speed of segmentation? 
This remains an open question that deserves further analysis. 
Unfortunately, no data from monosyllabic word segmentation 
in Dutch are available, which might have revealed successful 
segmentation at an earlier age than that obtained for trochees. 



The above mentioned studies involve languages tradition- 
ally grouped under the stress-timed category, whose rhythmic 
unit is not the syllable. Will monosyllabic word segmentation 
be facilitated early in development if the rhythmic unit of the 
ambient language is the syllable? And will segmentation of bi- 
syllabic words initially be delayed, being first segmented as two 
independent syllabic units and only later as whole units? Infant 
segmentation data from syllable-timed languages are actually lim- 
ited to French, although evidence obtained from two different 
French dialects (European and Canadian) is available. 

Monosyllabic word segmentation in French has not been 
extensively explored and only data available from a dissertation 
indicate that Parisian 7Vi month-olds could successfully seg- 
ment monosyllabic CVC items, using HPP with familiarization 
to words and test on passages (Gout, 2001). A familiarity pref- 
erence was also obtained there 1 . No data from infants tested at 
a younger age were gathered, so we do not know if monosyl- 
labic words in a syllable-timed language are actually easier to 
extract from fluent speech than similar words in stress-timed 
languages. What we actually know, however, is that bi-syllabic 
word segmentation in French is not easily attained, at least 
according to data from infants exposed to the European French 
dialect who could only succeed at successfully segmenting iambs 
by 16 months of age (Nazzi et al., 2006). Data from French- 
learning infants exposed to the Canadian dialect did not replicate 
European French results, however. No "delayed" segmentation 
ability was identified in Canadian French-learning infants com- 
pared to a group of Canadian English young learners tested 
at 8 months on two-syllable word segmentation (iambic and 
trochaic patterns, respectively): both groups succeeded, although 
segmentation strategies certainly differed and were adjusted to the 
properties of the native language, so no group was able to segment 
the items in the other language (Polka and Sundara, 2012). 

Because Nazzi et al.'s (2006) and Polka and Sundara's (2012) 
work involved a considerable amount of experiments to more 
thoroughly explore segmentation abilities in the populations 
under study, some relevant findings about the segmentation of 
the syllabic components of the iambic items could be identified. 
Clear evidence of final syllable segmentation was obtained at 12 
months and some evidence of initial syllable segmentation could 
also be found at the same age in European French infants suggest- 
ing that a syllable-based segmentation procedure is applied before 
bi-syllabic words can be successfully segmented as whole units 
(Nazzi et al., 2006). Similarly, although at an earlier age, results 
in Canadian French also revealed some ability to segment each 
isolated syllable of the iambic target words, although the transi- 
tion from an initial syllable-based segmentation to a successful 
whole bi-syllabic word segmentation could not be established in 
that research as only groups of 8-month-olds' were tested (Polka 
and Sundara, 2012). Relevant for our own research on mono- 
syllabic word segmentation, the Canadian study found opposite 
response patterns when familiarization involved whole iambic 



In a different study focused on the segmentation of monosyllabic verb forms 
by Canadian-French learning infants, positive evidence of segmentation, also 
based on a familiarity preference, was obtained at 1 1 months of age, but not 
earlier (Marquis and Shi, 2008). 
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words or only their syllabic components. The novelty preference 
pattern obtained when syllables instead of whole words were pre- 
sented in the familiarization phase suggests that syllables might 
be more easily identified because they match the rhythmic unit in 
this language. 

Taken together, and compared to data from segmentation in 
stress-timed languages, research done in French reveals important 
cross-linguistic differences, not only in the emergence of segmen- 
tation abilities, but also in the strategies used, which reflect the 
rhythmic nature of the language of exposure. However, French 
results are not clear-cut especially due to the non-trivial timing 
difference in the emergence of segmentation abilities found when 
both dialects are compared. Even if differences can be attributed 
to factors derived from specific properties of these dialects or 
the testing material, the fact is that behavioral results so far have 
only partially confirmed a hypothetic ease to segment monosyl- 
labic words or track syllabic elements in fluent speech, as it could 
be expected if the syllable is the rhythmic unit for segmenta- 
tion in syllable-based languages (but see Goyet et al., 2010 for 
a re-assessment of syllabic segmentation using ERP measures). 
Studying early segmentation abilities in infants acquiring other 
syllable-timed languages could shed more light on the early rhyth- 
mic segmentation hypothesis and help clarify results obtained 
so far. 

Spanish and Catalan have also been traditionally grouped 
under the syllable-timed typology, although some metric dis- 
tinctions have been described in studies comparing the rhythmic 
properties of these two languages. Some authors consider Catalan 
a rhythmically-intermediate language between the stress-timed 
and syllable-timed typologies (Nespor, 1990). Catalan, but not 
Spanish, has vowel reduction, a property that can affect syl- 
labic rhythm and determines differences in the type of vowels 
that can appear in unstressed syllable positions (Prieto et al., 
2012). Catalan allows for more complex consonant clusters in 
coda position, while syllabic structures are simpler in Spanish. As 
a consequence, %V metrics have been found to be significantly 
lower in Catalan than in Spanish. However, higher variability in 
vocalic interval duration (i.e., higher VarcoV scores that charac- 
terize languages with vowel reduction) has not been confirmed, 
with Catalan even showing lower variability scores than Spanish 
according to Payne et al.'s (2009) work. More recent research 
has corroborated that vowel reduction in Catalan does not seem 
to substantially increase variability in vowel interval duration 
(Prieto et al, 2012). In sum, while some rhythmic differences 
between Catalan and Spanish exist, the classification of Catalan 
as a rhythmically-intermediate language between syllable-timed 
and stress-timed typologies remains controversial. Although an 
in-depth and systematic comparison between Spanish, Catalan, 
and French rhythm metrics is not available, measures from dif- 
ferent studies involving different sets of material would suggest 
a non-overlapping distribution of these three "syllable-timed" 
languages over the %V and VarcoV rhythmic plane (White and 
Mattys, 2007; Payne et al, 2009). Among these three languages, 
Spanish would show the highest %V and the lowest VarcoV 
scores, while French would show the opposite tendency (i.e., 
higher VarcoV and lower %V scores), and Catalan would be 
placed in an intermediate position, probably more similar to 



Spanish in terms of vocalic interval variability (VarcoV), as the 
above mentioned studies have revealed. Given these differential 
metrical characteristics, Spanish and Catalan are good language 
candidates to extend word segmentation studies in syllable- 
timed languages other than French and explore infants' early use 
of a syllabic segmentation strategy. In particular, the compari- 
son between Catalan-learning and Spanish-learning groups can 
reveal if the (minor) rhythmic differences between these two lan- 
guages have an impact on the emergence of the segmentation 
ability. 

To sum up, the present research was designed to explore the 
emergent ability to segment monosyllabic word-forms by infants 
acquiring Spanish, Catalan, but also both languages simultane- 
ously from birth. To our knowledge, word segmentation abilities 
in bilingual infants have begun to be explored only in English- 
French environments, with preliminary data available so far 
showing bi-syllabic word segmentation ability in both languages 
by 8 months of age (Polka and Sundara, 2003). The inclusion of 
bilingual participants in this research, exposed to languages tra- 
ditionally grouped into the same rhythmic class, but nonetheless 
showing some minor differential rhythmic properties, can con- 
tribute to clarify the actual impact that bilingual exposure can 
have on the emergent ability to extract words from connected 
speech, when segmentation strategies derived from each of the 
ambient languages are likely to converge. 

In the present research, evidence of an emergent segmenta- 
tion ability will be explored using the HPP technique, in line 
with the work just reviewed coming from both stress-timed 
and syllable-timed languages. However, we have selected the 
less frequent order in this type of experiments, involving pas- 
sages first and test on lists of isolated words. Because similar 
segmentation effects were obtained independently of the test- 
ing order in the original Jusczyk and Aslin's (1995) study, we 
opted for the passages-first order to promote segmentation spon- 
taneously arising from a more natural context and to avoid 
initially biasing participants to attend to a specific word or syllabic 
unit. 

In our first experiment we analyzed 8-month-olds' ability to 
segment words that match the rhythmic unit of their native 
language. No great difficulties were expected for monosyllabic 
word segmentation in our Catalan and Spanish participants, but 
given the limited data available in French and the slightly delayed 
emergence of the segmentation ability, even for the syllabic com- 
ponents of the bi-syllabic words, found by Nazzi et al. (2006), 
evidence from the three groups tested at 8 months would be most 
informative about the timing of this emergent ability in languages 
different from French but having syllables as the basic rhythmic 
units. 

In a second experiment we wanted to further explore if evi- 
dence of monosyllabic word segmentation could be found at an 
earlier age (6 months) in syllable-timed languages compared to 
stress-timed ones, due to the direct match between the target 
elements (monosyllabic words) and the rhythmic unit for seg- 
mentation (the syllable). If confirmed, results would not only 
give support to the early rhythmic segmentation hypothesis, but 
they would also suggest the need to take into account additional 
differences in the rhythmic properties of languages traditionally 
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grouped into the same rhythmic typology, as these properties 
might lead to differences in the timing of the emergence of 
the segmentation ability. Recall that the earliest evidence for 
monosyllabic word segmentation in French comes from a single 
experiment with 7 1 /2-month-olds showing a familiarity preference 
response pattern (Gout, 2001; Gout, unpublished dissertation). 

The ultimate aim of the present study is to set the ground- 
work for future research exploring the emergence of the ability 
to segment multi- syllabic word-forms both in Spanish- and in 
Catalan-learning infants. Knowledge about the ability and the 
segmentation strategies used to extract short, simple monosyl- 
labic units from connected speech can offer valuable information 
to better understand the specific problems that segmenting bi- 
and tri-syllabic words in syllable-timed languages with variable 
stress can pose to the infant learner. 

EXPERIMENT 1: WORD SEGMENTATION AT 8 MONTHS 
PARTICIPANTS 

A total of 54 healthy full-term infants with no history of hear- 
ing or vision problems were included in the sample divided into 
three groups (N = 18 in each group) according to the language/s 
spoken in their environment (Catalan only, Spanish only or both 
languages on a daily basis). Mean age of the infants in the Catalan 
monolingual group was 8 months 4 days (range: 7 months, 15 
days-8 months, 22 days); in the Spanish monolingual group was 
8 months 6 days (range: 7 months, 19 days-8 months, 25 days) 
and in the bilingual group was 8 months 6 days (range: 7 months, 
13 days-8 months, 15 days). No significant between-group age 
differences were found (F < 1). Participants were assigned to 
different language groups based on the information obtained 
through a questionnaire to the parents that offered an estimate of 
the daily and weekly amount of exposure to the languages in their 
environment (Bosch and Sebastian-Galles, 2001). To be included 
in a monolingual group, participants had at least 75% of regular 
exposure to either Catalan or Spanish, while a more balanced dis- 
tribution between these two languages was required for inclusion 
in the bilingual group. Mean percentage of exposure to Catalan in 
the Catalan monolingual group was 92% (range: 75-100%) and 
to Spanish in the Spanish monolingual group was 93% (range: 
80-100%). From the 18 infants in the bilingual group, seven had a 
higher amount of exposure to Spanish than to Catalan (66-34%) 
and they were tested on Spanish material. The remaining infants 
had a higher exposure to Catalan than to Spanish (63-37%) and 
were tested on Catalan material. Fourteen additional infants were 
also tested but excluded from the final sample due to fussiness 
or crying leading to incomplete testing (4), very short looking 
time — below 1 s — to trials in the test phase (6), preterm birth (1) 
and experimental error (3). 

STIMULI 

Target Spanish and Catalan monosyllabic words with full vow- 
els and a CVC (bus, mar, gol -"bus," "sea," and "goal"-) or 
CCVC (tren -"train") structure were selected because of their 
cognate status in the languages under study (for simplicity, 
from now on we will refer to all target words as having a 
monosyllabic CVC structure). Target words were nouns that are 
infrequent in the first receptive and expressive vocabularies of 



1 -year-olds acquiring Spanish, Catalan or both (Aguila et al., 
2005). 

Four passages were created, formed by six different sentences 
each with the target word appearing once per sentence in different 
positions (twice in initial, twice in medial and twice in final sen- 
tence positions). Because the experimental design involved two 
different conditions (half of the participants were familiarized 
with "train-bus" passages -TB-, and the other half with "gol-mar" 
passages -GM-), parallel sentences were used to make conditions 
equivalent (see Table 1). Adjacent syllables to the target words 
(from words preceding or following the target nouns) were con- 
trolled so that no specific syllabic sequences appeared repeatedly 
within the passage. Mean duration of the sentences in the pas- 
sages was 2.3 s and total length of the passages was adjusted 
to 18 s by inserting short pauses of about 700 ms between sen- 
tences. Passages had 45-46 syllables each and especial care was 
taken to build equivalent passages for the Spanish and Catalan 
versions of the material. Sentences were not always perfect trans- 
lations because length of the words tends to be shorter in Catalan 



Table 1 | Catalan and Spanish sentences forming the passages used 
in the familiarization phase. 

"Tren" passage {train) 

Catalan: Un tren te sis o set vagons. Veig un gran tren des d'aqui. El tren 
no s'atura mai. A la foto hi ha aquell tren. Mira aquest cotxe a prop del tren. 
Arriben en tren molt d'hora 

Spanish: Un tren tiene seis vagones. Veo un gran tren desde aqui. El tren 
nunca esta parado. En la foto esta aquel tren. Mira este coche junto al tren. 
Llegan en tren manana 



"Bus" passage (bus) 

Catalan: Un bus va venir de sobte. Esperava el primer bus. Recordo aquest 
bus cada dia. El bus no era massa bo. M'encanta el seu bus de cartro. 
Somiare amb el meu bus 

Spanish: Un bus llega de repente. Esperan otro bus. Recuerdo aquel bus 
cada dia. El bus no era largo. Me encanta su bus de carton. Sohare con 
este bus 



"Mar" passage (sea) 

Catalan: Un mar te milers de peixos. Veig un gran mar des d'aqui. El mar 
no s'atura mai. A la foto hi ha aquell mar. Mira aquest cotxe a prop del mar. 
Arriben per mar molt d'hora 

Spanish: Un mar tiene muchos peces. Veo un gran mar desde aqui. El mar 
nunca esta calmado. En la foto esta aquel mar. Mira este coche junto al 
mar. Llegan por mar mahana 



"Gol" passage {goal) 

Catalan: Un gol va venir de sobte. Esperava el primer gol. Recordo aquest 
gol cada dia. El gol no era massa bo. M'encanta el seu gol de talo. Somiare 
amb el meu gol 

Spanish: Un gol llega de repente. Esperan otro gol. Recuerdo aquel gol 
cada dia. El gol no era bueno. Me encanta su gol de tacon. Sohare con 
este gol 

In the experimental design half of the participants were familiarized to "Tren- 
Bus" (train-bus) passages and the other half to "Mar-Gol" (sea-goal) passages. 
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and we wanted to keep with the same number of syllables per 
sentence. In spite of minor meaning differences between Spanish 
and Catalan sentences (irrelevant to study word segmentation 
in early infancy), the final passages represent equivalent ver- 
sions of the material in terms of number of syllables and total 
length. 

Word lists involving 12 isolated productions of each of the four 
target words were also needed for use in the test phase of the 
experiments. Six different tokens of the same noun repeated twice 
in a randomized order formed each of the four experimental word 
lists in this study. Total length of the word lists wasl8 s as lists were 
built by adding silence to the end of the stimulus to reach a 1.5 s 
duration (mean length of the words in the lists in each language 
is reported in Table 2). 

Passages and words were produced by a highly proficient 
Spanish-Catalan bilingual female speaker and she was instructed 
to use infant direct speech, as if speaking to a young child. 
The stimuli were recorded in a single session in a comfort- 
able, sound attenuated booth equipped with an omni-directional 
microphone. Utterances were recorded directly onto a Pentium- 
Ill PC using Sound Edit (version 2.99) software. The operating 
system was Windows XP. Online monitoring ensured optimal 
sound quality recording. 

Finally, to ensure similarity between the materials for each lan- 
guage, acoustic analyses on target words extracted from the pas- 
sages and words in the lists were conducted using Praat software 
(version 5.3.22). Mean values of word duration and amplitude 
(for the entire word) and pitch (calculated on the vocalic por- 
tion of the word) were obtained and are reported in Table 2, both 
for Catalan and Spanish material. As expected, statistical analyses 
only revealed significant differences in duration between words 
extracted from the passages and words produced in isolation. 



Table 2 | Acoustic measures of target words in passages 
(familiarization) and lists (test) for Catalan and Spanish material. 



Passage words 
Mean (SD); range 



List words 



Mean (SD); range 



DURATION (ms) 

Catalan 380 (43.4); 331-488 

Spanish 378 (59.5); 290-536 

p n.s. 
AMPLITUDE (dB) 



Catalan 
Spanish 

P 

PITCH (Hz) 

Catalan 
Spanish 
P 



71.9 (4.2); 65.4-80 
72.9 (3.1); 68.4-78.4 
n.s. 



596 (78.2); 474-763 
599 (120); 422-863 
n.s. 

70.5 (1.2); 68.2-72.5 
71.4 (3.3); 61.2-77.2 
n.s. 



n.s. 
n.s. 



256 (61); 174-388 
274(57); 194-396 
n.s. 



267 (40); 201-372 
283 (56); 201-406 
n.s. 



n.s. 
n.s. 



Measurements include whole word duration (ms), followed by amplitude (dB) 
and pitch (Hz) calculated on the vocalic portion of the words. Significant differ- 
ences are also indicated. 

Results oft(23) tests: ***p < 0.007; n.s. p > 0.05. 



Amplitude and pitch measurements were found equivalent both 
within each language and also between languages (see details in 
Table 2). 

PROCEDURE 

The familiarization-preference procedure with familiarization to 
passages and test on lists of words [as in Experiment 4, by 
Jusczyk and Aslin (1995)], was implemented in this research. 
The testing took place in a three-sided test booth, but instead 
of a frontal and two lateral lights typically used in the HPP 
set-up, a frontal display involving three computer screens and 
two concealed loudspeakers below the left and right monitor 
screens was used [this set-up had already been satisfactorily 
used by Bosch and Sebastian-Galles (2001), to test for language 
discrimination in young infants]. Babies were seated on their 
parent's lap facing these three frontal monitor screens. Parents 
were listening to music through headphones throughout the 
whole experimental session. An experimenter, inside the test- 
ing room but out of the view of the infant, watched infants' 
looking behavior through a TV monitor, controlled trial pre- 
sentation and recorded online infants' attention. By pressing 
and releasing the mouse button the experimenter could regis- 
ter the direction and duration of the infant look fixation toward 
the side screen involved in the presentation of the audio files 
in each trial. Online information about total attention time in 
each trial for each participant was stored and could later be 
checked against the results from off-line coding of the record- 
ings to assess reliability of the measures and detect experimenter 
errors. 

The experimental session began with a familiarization phase 
in which TB or CM passages were presented on alternating trials 
until the infant accumulated 45 s of attention time to each pas- 
sage. Because of this criterion, infants could hear the target words 
in sentential contexts about 18 times each. Immediately after 
completing familiarization, the test phase began. It involved 16 
test trials (four target word lists presented in four blocks). Words 
within each list were randomly presented and the order varied for 
each participant. At the beginning of each trial the central mon- 
itor displayed a flashing green circle to direct infants' attention 
toward the center. Immediately afterwards, one of the two lateral 
monitors displayed a flashing red circle to capture infant's atten- 
tion and as soon as the infant oriented toward that side screen 
the audio files were presented. Auditory material was played until 
trial completion (18 s) or until the infant ceased to look in that 
direction for more than two consecutive seconds. In case of trial 
interruption, passage presentation was not resumed in the next 
trial, but started again from the beginning. Looks away below 
2 s duration did not interrupt trial presentation but time away 
was not included in the final amount of fixation for that specific 
trial. 

DESIGN 

Half of the infants were familiarized to passages containing the 
target nouns "tren-bus" (TB condition) and the other half to pas- 
sages containing the target nouns "gol-mar" (CM condition). In 
the test phase all participants were presented with the four target 
word lists. 



Frontiers in Psychology | Language Sciences 



March 2013 | Volume 4 | Article 106 | 6 



Bosch et al. 



Gains in segmenting fluent speech 



RESULTS 

Separate analyses were run, one on changes in attention time from 
the first to the last trial in the familiarization phase, and the other 
on attention time to familiar vs. novel words in the test phase. 
Regarding changes in attention during familiarization to the pas- 
sages, a repeated-measures ANOVA with looking time as depen- 
dent measure, language group (Catalan, Spanish, bilingual) and 
familiarization condition (TB or GM) as a between-group factors 
and trial (first, last) as repeated measures revealed a highly sig- 
nificant effect of familiarization trial 48) = 82.9; p = 0.0001; 
r) 2 = 0.63], but no effect of language group or condition and no 
significant interactions (all F's < 1). All three groups thus showed 
similar decays in attention from the first to the last trial in the 
familiarization phase when they were presented with the pas- 
sages containing repetitions of two target words (mean attention 
time to first and last trial was, respectively, 15.1 s and 9.9 s in the 
Catalan monolingual group, 16 s and 8.2 s in the Spanish mono- 
lingual group and 15.2 s and 10.5 s in the bilingual group). Paired 
f tests conducted separately for each group on attention time 
to first and last familiarization trial confirmed the similarity in 
behavior [Spanish monolingual: t(\j) = 6.1, p = 0.0001; Cohen's 
d = 2.0]; [Catalan monolingual: t(if) = 4.7, p = 0.0001; Cohen's 
d = 1.41]; [bilingual: t (17) = 4.9, p = 0.0001; Cohen's d = 1.21]. 

We also analyzed if groups differed in the number of trials 
to reach criterion. A one-way ANOVA on number of trials in 
the familiarization phase as dependent measure and condition 
(TB vs. GM) and language group (Spanish, Catalan and bilin- 
gual) as between-subjects factors revealed no significant effects 
(F's < 1) or interaction [F {2 , 48) = 1-77, p = 0.18; n 2 = 0.69]. 
These results suggest that duration of the familiarization and 
participants' looking behavior in this phase can be considered 
equivalent. 

To assess word segmentation, mean attention time to familiar 
vs. novel words in the test phase was computed for each partic- 
ipant (see Figure 1). A repeated-measures ANOVA with mean 



looking time as dependent measure, language group (Catalan, 
Spanish, bilingual) and familiarization condition (TB or GM) as 
a between-group factors and type of word (familiar, novel) as 
repeated measures was run. Results only revealed a highly sig- 
nificant main effect of type of word [F(i, 48) = 21.6; p = 0.0001; 
T) 2 = 0.31], with no language group or condition effects (both 
F's < 1) and no interactions. Paired t tests conducted separately 
for each group on mean attention time to familiar vs. novel word 
lists confirmed the presence of significant differences in atten- 
tion to the two types of words, thus indicating that segmentation 
of monosyllabic words had been reached [Spanish monolingual: 
familiar words M = 6.6 s (SD = 3.2) and novel words M = 7.9 s 
(SD = 2.7);f(i 7) = -2.7, p = 0.015; Cohen's d = 0.41]; [Catalan 
monolingual: familiar words M = 6.1 s (SD = 2.8) and novel 
words M = 7.5 s (SD = 3); t (17) = -2.6, p = 0.019; Cohen's d = 
0.48]; [bilingual: familiar words M = 5.7 s (SD = 2.3) and novel 
words M=7s (SD = 2.7); f (17) = -2.8, p = 0.011; Cohen's 
d = 0.51]. Interestingly, however, the pattern of preference that 
was obtained at 8 months across all three groups was not the 
expected one, as a familiarity preference rather than novelty is 
typically observed in segmentation tasks using natural language. 
The monosyllabic nature of the target items, the fact that they 
were presented twice in sentence-final position in the familiariza- 
tion passages, together with the use of IDS and a sufficiently long 
familiarization phase are possible factors that might explain this 
unexpected novelty preference, which is more likely to be found 
when the task is relatively easy and can be completed within the 
temporal limits established by the procedure. 

EXPERIMENT 2: WORD SEGMENTATION AT 6 MONTHS 
PARTICIPANTS 

As in Experiment 1, a total of 54 healthy full-term infants with 
no history of hearing or vision problems were included in the 
sample divided into three groups (N = 18 in each group) accord- 
ing to the language/s spoken in their environment (Catalan only, 
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FIGURE 1 | Mean attention time (s) and standard error to familiar and novel words presented in the test phase, for the 8-month-old infants, grouped 
by language environment (monolingual Catalan, monolingual Spanish and bilingual). 
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Spanish only or both languages on a daily basis). Mean age of 
the infants in the Catalan groups was 6 months 6 days (range: 
5 months, 22 days-6 months, 29 days); in the Spanish mono- 
lingual group was 6 months 4 days (range: 5 months, 19 days-6 
months, 27 days) and in the bilingual group was 6 months 7 days 
(range: 5 months, 19 days-6 months, 27 days). No significant 
between-group age differences were found (F < 1). Following 
the information from the initial language questionnaire to par- 
ents, participants were assigned to different language groups. 
Inclusion criteria were the same as in Experiment 1. Mean per- 
centage of exposure to Catalan in the Catalan monolingual group 
was 91% (range: 80-100%) and to Spanish in the Spanish mono- 
lingual group was 95% (range: 75-100%). From the eighteen 
infants in the bilingual group, 12 had a higher amount of expo- 
sure to Spanish than to Catalan (65-35%) and they were tested 
on Spanish material. The remaining six had a higher exposure 
to Catalan than to Spanish (64-36%) and they were tested on 
Catalan material. Twenty-nine additional infants were also tested 
but excluded from the final sample due to fussiness or crying lead- 
ing to incomplete testing (22), very short looking time — below 
1 s — to trials in the test phase (5) and experimental error (2). 

STIMULI, PROCEDURE. AND DESIGN 

Same as in Experiment 1. 

RESULTS 

Separate analyses were also run on data from these younger-age 
groups to explore attention behavior in the familiarization phase 
(expected decay of looking time) and possible differences in atten- 
tion time to familiar vs. novel words in the test phase, as indicative 
of successful word segmentation. 

Concerning attention behavior during familiarization, a 
repeated-measures AN OVA with looking time as dependent mea- 
sure, language group (Catalan, Spanish, bilingual) and familiar- 
ization condition (TB or CM) as a between-group factors and 
trial (first, last) as repeated measures revealed a highly signif- 
icant effect of familiarization trial [F(i i 48) = 42.9; p = 0.0001; 
r) 2 = 0.47], and no effect of language group or condition and no 
significant interactions [familiarization trial x language group: 
F(2, 48) = 1.15; p = 0.32; n 2 = 0.04; familiarization trial x con- 
dition: F(i i 48) = 1.7; p = 0.19; r\ 2 = 0.03; familiarization trial x 
language group x condition: F < 1]. All three groups showed 
a decrement in their attention time during familiarization to 
passages containing repetitions of target words (mean attention 
time to first and last trial was, respectively, 15.5 s and 12.3 s in 
the Catalan monolingual group, 15.5 s and 10.6 s in the Spanish 
monolingual group and 14.9 s and 9.1 s in the bilingual group). 
Paired t tests conducted separately for each group on attention 
time to first and last familiarization trial confirmed the similar- 
ity in this behavior [Spanish monolingual: t(i?) = 4.1, p = 0.001; 
Cohen's d = 1.24]; [Catalan monolingual: t(yj) = 2.9, p = 0.009; 
Cohen's d = 0.85]; [bilingual: t (17 ) = 4.2, p = 0.001; Cohen's d = 
1.29]. We also analyzed if groups differed in the number of trials 
to reach criterion. A One -Way ANOVA on number of trials in the 
familiarization phase as dependent measure and condition (TB 
vs. CM) and language group (Spanish, Catalan, and bilingual) as 
between-subjects factors revealed no significant effect of language 



group (F < 1), but a significant effect of condition [F(i, 48) = 
5.59, p = 0.02; r| 2 = 0.1], with no significant group x condi- 
tion interaction [Fp, 48) = 1. 1 1, p = 0.33; r) 2 = 0.04]. Follow-up 
t tests revealed that mean number of trials to reach criterion in the 
TB condition (9.1) was significantly higher than in the CM condi- 
tion (7.5) [f (26 ) = -2.56, p = 0.017; Cohen's d = 0.67]. Further 
t tests by language groups indicated that only in the monolingual 
Catalan group the number of trials to reach criterion was sig- 
nificantly different by condition [t(g) = —2.8, p = 0.02; Cohen's 
d = 1.5]. Differences by condition did not reach significance 
in the other two groups [monolingual Spanish: t(g) = —1.04, 
p = 0.32; Cohen's d = 0.26; bilingual group t < 1]. 

To analyze word segmentation ability in the younger groups, 
mean attention time to familiar vs. novel words in the test phase 
was computed for each participant (see Figure 2). A repeated- 
measures ANOVA with mean looking time as dependent measure, 
language group (Catalan, Spanish, bilingual) and familiariza- 
tion condition (TB or GM) as a between-group factors and 
type of word (familiar, novel) as repeated measures was run. 
Results only revealed a highly significant main effect of type 
of word [F ( i_ 48) = 18.9; p = 0.0001; r\ 2 = 0.87], with no lan- 
guage group or condition effects [language group: F(2, 48) = 
1.9; p = 0.15; r\ 2 = 0.07; and condition F < 1] and no sig- 
nificant interactions (F's < 1). Paired t tests conducted sepa- 
rately for each group on mean attention time to familiar vs. 
novel word lists confirmed the presence of significant differ- 
ences in attention to the two types of words, thus indicating 
that segmentation of monosyllabic words had successfully been 
reached at this early age [Spanish monolingual: familiar words 
M = 8.6 s (SD = 3.1) and novel words M = 7.1 s (SD = 2.7); 
f( 17) = 2.2, p = 0.035; Cohen's d = 0.48]; [Catalan monolingual: 
familiar words M = 7.1 s (SD = 2.8) and novel words M = 6.1 s 
(SD = 2.7); f(i 7) = 4.3, p = 0.0001; Cohen's d = 0.39]; [bilin- 
gual: familiar words M = 6.7s (SD = 2.8) and novel words 
M = 5.5s (SD = 2.9); t aT) = 2.2, p = 0.038; Cohen's d = 0.41]. 
Overall, results indicate that 6 month olds (monolinguals and 
bilinguals) can segment monosyllabic words from sentential con- 
texts and evidence for this ability is reflected in the familiarity 
preference response pattern observed for words in the test phase. 
This is actually the usual preference pattern obtained in this 
type of task and it differs from the pattern found with the older 
groups tested in this research using exactly the same material and 
procedure. 

A final analysis involving data from the two age groups 
was undertaken and only the age (6 vs. 8 months) per type 
of fist (familiar vs. novel) interaction was deemed significant 
[F(i, 102) = 40.4; p = 0.0001; i) 2 = 0.28], confirming the radi- 
cal change in the direction of preference that had taken place 
between the two ages under analysis. No other effects or inter- 
actions were found significant in this global analysis. We also 
extended the analysis to the attention time measures in the 
familiarization phase to check for any between-age differences 
in attention behavior to trials in the familiarization that could 
be related to the word preferences observed in the test phase. 
Results yielded no evidence of significant differences by age 
related to the attention time measures in the familiarization phase 
[F(i, 102) = 1-55, p = 0.21; r| 2 = 0.01], nor in the number of 
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FIGURE 2 | Mean attention time (s) and standard error to familiar and novel words presented in the test phase, for the 6-month-old infants, grouped 
by language environment (monolingual Catalan, monolingual Spanish and bilingual). 



trials to reach criterion [F(i t 107) = 1.34, p = 0.24; r| 2 = 0.14]. 
To sum up, although age differences did not seem to affect the 
behavior in the familiarization phase, they were determinant in 
the preference pattern observed in the test. 

DISCUSSION 

This research has explored young infants' emerging ability to 
segment simple, monosyllabic word-forms from fluent speech 
(natural language) in syllable-timed languages other than French. 
Six and eight-month-old participants growing up in Catalan 
monolingual, Spanish monolingual and Spanish-Catalan bilin- 
gual families were tested on a version of HPP, with familiarization 
to passages containing repetitions of two different target words in 
sentential contexts, and tested on words. Results revealed that all 
groups at both ages were able to successfully segment words from 
the passages and recognize them in the test phase. Critically, how- 
ever, the predominant response pattern obtained differed with 
age. While younger infants showed a familiarity preference, by far 
the most frequent pattern found in segmentation studies using 
natural language paradigms (Jusczyk and Aslin, 1995; Jusczyk 
et al, 1999a,b; Mattys et al., 1999; Mattys and Jusczyk, 2001a,b; 
Houston et al., 2004), older groups showed a novelty preference, 
i.e., a preference for the lists involving words not included in the 
familiarization passages. 

Familiarity or novelty preference patterns are usually 
attributed to the ease or difficulty to solve the task at hand 
(Hunter and Ames, 1988). As an example, in segmentation 
studies using artificial language paradigms evidence of segmen- 
tation is usually linked to a novelty preference for part-words 
over words in the familiarization material. This is no surprise as 
artificial languages in these studies are considered more simplistic 
than natural languages, syllabic sequences lacking the higher 
levels of variability found in any of the relevant dimensions in 
natural speech material (Pelucchi et al., 2009). Based on language 
complexity factors, word segmentation experiments run on 
natural speech material are thus likely to yield results showing a 



familiarity rather than a novelty preference and this is the pattern 
that a priori could be expected in our research. However, our 
study focused on simple elements (monosyllabic words) to be 
segmented from passages recorded in IDS style and participants 
were acquiring a language in which the rhythmic unit is the 
syllable (Nazzi et al., 2006), so even if the experiment was run 
on natural language material, the sum of all these factors may 
have contributed to simplify the task, thus leading to a novelty 
preference pattern in infants' responses. This can especially be 
true at older ages, when greater ability in word segmentation 
might have already been acquired. 

There are thus a number of factors that may have facilitated 
word segmentation in the populations under study. The use of 
IDS in the recording of the material is one of them. IDS has been 
described as having a slower rate of speech, longer pauses and 
greater pitch excursions favoring infant's attention to it (Fernald 
and Kuhl, 1987). This style also uses simplified sentence structures 
that together with prosodic exaggeration can facilitate speech 
processing and the extraction of units from the segmentation 
perspective. Support for this interpretation comes from a seg- 
mentation study in which nonsense sentences either with in ADS 
or IDS style were used to test segmentation ability in 7- and 
8-month-old infants (Thiessen et al., 2005). Results indicated 
that only in the IDS condition segmentation could be reached, 
thus the prosodic characteristics of IDS seem to have facilitated 
the extraction of word-form units from material that was other- 
wise equivalent in terms of the statistical cues that could be used 
for segmentation. In our material, where sentences in the pas- 
sages were about 7-8 syllables long and target words where often 
aligned to phrase boundaries, clearly demarcated by pauses, the 
extraction of the target elements from the passages was certainly 
facilitated (Seidl and Johnson, 2006). It is worth mentioning 
here that words in the passages and words in the lists differed 
in duration, as reported in Table 2. Variability did not preclude 
recognition of the target items: infants in any of the two age 
groups in this study did not fail to notice the correspondence 
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between the target words placed in sentential contexts and the 
words presented in isolation in the test phase, when duration 
was longer than when they were produced in sentential contexts. 
This result is similar to what has already been found in previous 
research using natural speech (Jusczyk and Aslin, 1995; Jusczyk 
et al., 1999a; Mattys and Jusczyk, 2001a,b), but it is reported here 
for infants tested at a younger age (6 months). 

Another factor favoring word segmentation in our research 
is related to the length of the words (monosyllabic CVC items) 
and the match with the rhythmic unit of the languages under 
study. This is actually a key issue in our research. From the early 
rhythmic segmentation hypothesis, syllabic units would play a 
determinant role at the onset of word segmentation for infants 
acquiring languages with a syllable-timed rhythm (Nazzi et al., 
2006). Thus, segmentation of monosyllabic word-forms should 
be easier in these languages than in languages belonging to a 
different rhythmic typology in which the rhythmic unit might 
not be the syllable. The fact that positive evidence for monosyl- 
labic word segmentation has been obtained at 6 months of age 
in either Catalan-learning and Spanish-learning infants suggests 
that the match between the rhythmic unit and the length of the 
target words in our study may have favored an early onset of 
the segmentation abilities in our populations. Recall here that 
English-learning infants succeeded at monosyllabic word seg- 
mentation with natural language material at 7V4 months of age 
but not earlier (Jusczyk and Aslin, 1995), unless highly famil- 
iar words such as the infants' own names preceded the target 
monosyllabic units facilitating the extraction of adjacent elements 
(Bortfeld et al., 2005). Without these or other additional cues to 
segmentation, English-learning infants seem to start segmenting 
simple word forms from fluent speech around 7 months of age. 

It is interesting to note that in spite of the presence of some 
differential features between Catalan and Spanish possibly affect- 
ing their rhythmic properties (vowel reduction and more complex 
consonantal codas in the former), these differences have not 
had any clear impact on the emergence of the segmentation 
ability for monosyllabic word-forms. Neither the monolingual, 
nor the bilingual groups in this research have shown significant 
differences in their behavior in the segmentation task. 

Data from other syllable-timed languages that could support 
the early rhythmic segmentation hypothesis are limited to French 
and mostly focused on bi-syllabic word segmentation (Nazzi 
et al., 2006; Goyet et al, 2010 for European French, and Polka 
and Sundara, 2012, for Canadian French) so no data are available 
regarding 6-month-old French-learning infants solving a word 
segmentation task. However, as mentioned in the introduction, 
these studies have reported a certain ease for syllabic segmen- 
tation compared to the segmentation of bi-syllabic words, so in 
spite of the differences and controversies when European and 
Canadian French segmentation studies are compared, there is 
some converging evidence about the facilitative role of the syl- 
lable as a unit for segmentation in these syllable-timed languages. 
But, finding differences between Spanish and Catalan on the one 
hand, and French on the other, on the early onset of word seg- 
mentation for monosyllabic units is also a possibility to be taken 
into account. Spanish and Catalan have contrastive and variable 
stress, a property not shared with French. Stress in French falls 



invariably on the last syllable of each word of phrase but it is 
actually mostly reduced in fluent speech; prominence of stressed 
syllables is very similar to their unstressed neighbors and only 
words in utterance -final position get some prosodic marking in 
the form of vowel lengthening (Tranel, 1987). The presence of 
variable stress in languages such as Catalan or Spanish can be a 
factor that enhances the perception of syllabic units, thus lead- 
ing to an earlier onset of the segmentation ability at least for short 
monosyllabic items. This remains an open question requiring fur- 
ther analysis, but the positive effect of variability in the input 
to the young learner has already been pointed out in research 
addressing different aspects of language acquisition. For instance, 
high levels of acoustic/phonetic variability deriving from the use 
of multiple exemplars (several tokens from multiple speakers) in 
a word learning task involving phonologically similar words led 
participants to successful learning while they failed in a more 
simple, single-exemplar condition (Rost and McMurray, 2009). 
Another example can be found in research showing that the learn- 
ing of non-adjacent dependencies is facilitated with decreasing 
predictability between adjacent elements, that is, the extraction 
of the invariant structure (the stable elements of a stimulus set) 
is actually easier with increasing variability of the irrelevant inter- 
vening elements (Gomez, 2002). Back to word segmentation in 
syllable-timed languages, it is possible to hypothesize that the 
presence of variable stress in the input may have enhanced the 
detection and extraction of monosyllabic word units. This is an 
issue to be further analyzed in future studies, where the facilita- 
tion effects of the syllable as the rhythmic unit for segmentation 
could be more carefully analyzed after controlling for other facil- 
itation effects derived from the paradigm, task demands, or the 
specific properties of the speech material in the test. 

The developmental change in the preference pattern obtained 
in our data, suggests rapid gains in segmentation ability for these 
short, monosyllabic units that match the rhythmic unit of the 
ambient language. Because the paradigm and material used in 
our experiments were exactly the same at both ages, the rever- 
sal of the preference pattern seems to confirm the ease to extract 
these short units from sentential contexts with increasing age. 
A reversal of the preference pattern had also been described in the 
literature (Thiessen et al., 2005), but in that case not only age but 
an extended familiarization phase were both factors that modi- 
fied the pattern obtained at an earlier age. This is not the case 
in our study as no manipulation of the paradigm was done. A 
simpler interpretation is thus that infants have gained expertise 
in segmenting fluent speech, especially regarding monosyllabic 
elements. The question remains whether similar results would 
be obtained for CV items, instead of CVC, and whether seg- 
mentation of function CVC or CV words in these languages, 
involving unstressed vowels, would also be successfully solved at 
an early age and by all language groups (the presence of vowel 
reduction in Catalan but not in Spanish may also contribute to 
differential results). This is clearly a topic to be explored in future 
research. 

The present paper has included participants growing up bilin- 
gual and their results deserve some comments. An early onset of 
monosyllabic word segmentation abilities has also been found 
in our Spanish-Catalan infant participants. The timing and 
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characteristics of their segmentation ability do not seem to dif- 
fer from results obtained in monolingual infants in our study, at 
least from a behavioral perspective. The bilingual results are rele- 
vant in that they are the first evidence of segmentation abilities in 
bilinguals acquiring languages with rather similar rhythmic prop- 
erties [so far, only preliminary data exist from French-English 
bilinguals showing bi-syllabic word segmentation ability in both 
languages by 8 months of age, as reported by Polka and Sundara 
(2003)]. Although bilinguals in our research are exposed to lan- 
guages that do not greatly differ in their rhythmic properties 
and, from this perspective, it could be predicted that no delays 
or differences in solving the segmentation task would be found, 
bilingual exposure might nevertheless lead to small differences 
in the developmental time-course of certain speech and language 
abilities, as for instance those found in the phonetic categoriza- 
tion domain (Bosch and Sebastian-Galles, 2003; Sebastian-Galles 
and Bosch, 2009). Even if the ambient languages do not show 
great differences in their rhythmic characteristics, segmentation 
abilities might have been slightly delayed in this population, just 
as a consequence of adaptive processes to cope with the more 
complex input the bilingual is exposed to. This was not the case in 
our data, with bilinguals showing parallel results to their mono- 
lingual counterparts. These data extend to the word segmentation 
domain the notion that bilingual exposure does not alter the 



pattern of acquisition as observed in monolingual populations 
(Werker and Byers-Heinlein, 2008). 

To sum up, results from this research (a) offer evidence of 
an early ability for monosyllabic word segmentation in syllable- 
timed languages such as Spanish and Catalan, not previously 
described in the literature; (b) reveal no differences between 
monolingual and bilingual participants in this task, probably 
because both languages in the bilingual environment share the 
same rhythmic properties; and (c) show a specific developmen- 
tal pattern that is compatible with an interpretation based on the 
facilitation effect that can be observed when rhythmic properties 
of the language match with the units to be extracted from flu- 
ent speech. These results should be the basis for further research 
exploring disyllabic word segmentation in the same linguistic 
population. They can also offer relevant information for future 
cross-linguistic research and they should be useful in studies com- 
paring normally developing infants and clinical groups at risk for 
language delays in speech segmentation tasks. 
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